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Within the Dempster-Shafer theory of evidence a non-Kolmogorovian kind of epistemic uncer¬ 
tainty arises, which is encoded using multi-valued maps. We analyse the possible implications 
such non-Kolmogorovian epistemic uncertainty may have for Bell-type inequalities relating to the 
Einstein-Podolsky-Rosen-Bohm (EPRB) thought experiment. Our analysis leads to a notion of con- 
textuality concerning complexes of physical measurement conditions. The use of multi-valued maps 
reveals an implicit link between this contextuality and counterfactual outcomes, and results in a 
formulation wherein the states of measurement devices are explicitly taken into account as part of 
the probabilistic event space. This reflects a conception of measurement that was advocated by Bell 
some time ago. It results in context-conditioned measure-theoretic probabilities, which do not obey 
Bell-type inequalities, but which are nonetheless perfectly compatible with local classical physical 
models. We give an example of a local classical model that reproduces the quantum mechanical 
predictions and that fits within the contextual framework. 
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I. INTRODUCTION 

Although we have recently celebrated the fiftieth year 
since Bell’s original work [1], Bell-type inequalities re¬ 
main a contentious issue [2]. From a mathematical per¬ 
spective Bell-type inequalities merely express constraints 
on random variables defined over a single Kolmogorov 
probability space. It is quite remarkable that such seem¬ 
ingly innocuous and straightforward results of probabil¬ 
ity theory have produced such heated and protracted de¬ 
bates within physics [3, 4] . In the past Stapp claimed that 
Bell’s inequality is “the most profound discovery of sci¬ 
ence” [5]. On the other hand, more recently Khrennikov 
has claimed that “The only value of Bell’s arguments was 
the great stimulation of experimental technologies for en¬ 
tangled photons. ” [6] . It seems that those who agree with 
Stapp tend to be physicists, while those who agree with 
Khrennikov tend to be probability theorists. 

In this paper we will argue that a single Kolmogorov 
space is very narrow as the basis for the treatment of 
realistic experiments. This limits the power of Bell- 
type inequalities in constraining the interpretation of 
physical models quite considerably. Our alternative ap¬ 
proach utilises concepts from the Dempster-Shafer theory 
of evidence [7, 8]. Unlike Kolmogorovian probabilities 
Dempster-Shafer probabilities are not additive, which 
makes the Dempster-Shafer theory more general [7, 8]. 
The main use we find for this theory is in the identifi¬ 
cation of a non-Kolmogorovian kind of epistemic uncer¬ 
tainty. This uncertainty is associated with a probabilistic 
event space whenever two or more observables are oper¬ 
ationally incompatible. It is encoded using multi-valued 
maps, which result in the replacement of a single Kol¬ 
mogorov space with multiple Kolmogorov spaces, each 
labeled by distinct measurement contexts. Within the 
treatment of EPRB-type experiments, the multi-valued 
maps allow us to establish a link between measurement 
contexts and counterfactual outcomes. When hidden 


variables are considered the multi-valued maps result in 
a contextual approach similar to Khrennikov’s [6, 9]. 

We begin in section II by considering Bell-type inequal¬ 
ities involving empirical data from a table. Such inequali¬ 
ties are based on phenomenological arguments alone and 
make no explicit reference to hidden-variables [10, 11]. 
Instead they concern counterfactual outcomes of hypo¬ 
thetical measurements. Using the Dempster-Shafer the¬ 
ory enables us to probabilistically treat counterfactual 
outcomes differently to factual outcomes. The proba¬ 
bilistic uncertainty associated with the former is non- 
Kolmogorovian (despite being purely epistemic), while 
the uncertainty in the latter is of the usual Kolmogoro¬ 
vian variety. In section III we provide a treatment involv¬ 
ing hidden-variables wherein multi-valued maps provide 
contextual observables and context-conditioned proba¬ 
bilities. These quantities are not constrained by Bell- 
type inequalities. In section IV we relate our contextual 
approach to conventional approaches both rigorous and 
heuristic, and assess the significance of Bell’s locality as¬ 
sumption. In section V we give an explicit example of a 
classical, deterministic, local model of an EPRB-type ex¬ 
periment that fits within our contextual approach, and 
that reproduces the quantum predictions. Finally, we 
summarise our findings in section VI. 


II. THE EPRB EXPERIMENT 

In Bohm’s version of the EPR experiment there are 
two spin-half particles produced by a common source. 
These particles travel in opposite directions labeled left 
and right. The spin of the left particle is measured us¬ 
ing a Stern-Gerlach (SG) device aligned along one of two 
possible directions o or a', and the right particle is mea¬ 
sured by a similar SG device aligned along either b or b'. 
The generalised Bell-type CHSH inequalities refer to the 
following theorem [12]. 
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Theorem. Let (H, E,/r) be a Kolmogorov probability 
space. Let real random variables A, A! : H —> {±} rep¬ 
resent the spin observables of the left particle along di¬ 
rections a and a' respectively, and let B, B' : fi —{±} 
represent the spin observables for the right particle along 
b and b' respectively. The following Clauser-Horne- 
Shimony-Holt (CHSH) inequalities hold; 

|/| := | (AB) + (AB') - (A'B) + (A'B')\ < 2 (1) 

where the correlations are defined by 

(CC 1 ) := f dn{u)C{u)C'(u). ( 2 ) 

Jn 

The proof is almost trivial, but is omitted for brevity. 


A. Phenomenological treatments and the 
Dempster-Shafer theory 

In a typical EPRB experiment measurements of the 
spins of an ensemble of N particle pairs results in a table 
of values such as that given in I. We let Ho denote the set 
of 2 N outcomes explicitly appearing in such a table. We 
denote by Hkpv the set of values (KPVs) that are known 
to have been possessed by the particles at the time each 
particle was measured. Trivially, any reasonable theory 
of physics allows us to set Ho = Hkpv- 

A standard argument that a local deterministic classi¬ 
cal theory cannot violate the CHSH inequalities proceeds 
as follows. Since, in a deterministic classical theory, the 
particles are assumed to possess spin values in all direc¬ 
tions whether or not they have been measured, one can 
imagine that the blank entries in the table I are in fact 
filled with variables which can take values + or —. Let 
us denote by Hupv the set of unknown spin values pos¬ 
sessed by the particles (UPVs). There are 2 2N concrete 
choices that can be made for Hypy as a se t of 2 N values 
each of which is either + or —. These different choices 
are labeled i = 1, ...,2 2JV . Each H^g represents a 

set of numbers called counterfactual outcomes. If, con¬ 
trary to the actual experiment that we are describing, we 
supposed that somehow all four spin directions a, a', b , b' 
were measured for each particle pair, then we would ob¬ 
tain a complete table H* = Ho U Hg 0 where the index i 
takes a fixed value within 1,..., 2 2N . Each ST comprises a 
complete table of outcomes, and each table can be viewed 
as a Kolmogorov space ST in which the probability for an 
outcome is simply it’s frequency of occurrence within the 
table. 

However, attempting to encode local classical deter¬ 
minism by making a concrete choice for Hypy as one of 
the Hqq conflates physical and probabilistic assumptions 
in way that may not be justified. Such an assumption 
is much stronger than classical determinism, because it 
assumes not only that there exist possessed values, but 
also that these values are known from having performed 


Run/ 

Particle pair 

Left particle 
a a' 

Right particle 
b b' 

1 

+ 

? 

+ 

? 

2 

? 

+ 

? _ 

3 

- 

? 

+ 

? 

4 

? 


_ ? 

N 


(+) 

(-) 

+ 


TABLE 1: A typical table of data collected in an EPRB exper¬ 
iment. Each run consists of one spin measurement on each 
particle. Each spin measurement is in one of two possible 
directions— a or a' for the left particle, and b or b' for the 
right particle. The set of all entries is denoted SI. The set of 
all entries containing either + or — is denoted SIkpv- The set 
of all entries containing a ? is denoted SIupv- In a complete 
table SIkpv = SI and SIupv = 0, so in a complete table the 
unknown entries would also contain outcomes ±. An example 
is given by the outcomes in brackets in the IV’th row. Since 
such outcomes do not exist in any table of data produced by 
an actual experiment, they are termed counterfactual. There 
are 2 2N possible permutations of 2 N variables j = ±. Each 
permutation comprises a set SIq 0 , i = 1,..., 2 2JV . For fixed i 
the set SIco gives one possible way to fill in the blank entries 
with counterfactual outcomes ±. 


measurements. The assumption that particles can simul¬ 
taneously possess definite spin values in different direc¬ 
tions, does not by itself mean that we should assign rel¬ 
ative frequencies to both KPVs and UPVs , as though 
both are measurement outcomes. Only KPVs should be 
treated as measurement outcomes. Put slightly differ¬ 
ently, the assumption that possessed values merely exist 
does not mean that we should necessarily treat the two 
sets Hkpv = Ho and Hypy as part of the same prob¬ 
ability space. Such a probabilistic restriction entails a 
physical constraint which is different to the one we seek 
to impose. 

Within a single Kolmogorov probability space all prob¬ 
abilities whether they are taken to pertain to UPVs or 
to KPVs are required to satisfy the same rules. Since 
UPVs are unknown, their probability assignments must 
be subjective. On the other hand KPVs appearing ex¬ 
plicitly in a table are afforded probability assignments 
in the form of objective relative frequencies. Clearly a 
single Kolmogorov space is too limited to even allow for 
the possibility that these two distinct types of probability 
might be treated differently. However, if we employ the 
Dempster-Shafer theory, such a distinction becomes pos¬ 
sible. Subjective UPV-probabilities and objective KPV- 
frequencies are generally viewed differently. The uncer¬ 
tainty associated with UPVs is interpreted as uncertainty 
in the underlying event space and is non-Kolmogorovian, 
although it is entirely epistemic. 
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Rather than viewing table I as defining 2 2N Kol¬ 
mogorov spaces, we instead view it as defining a single 
Dempster-Shafer probability space Q := Hkpv U fiupv 
in a sense that will become clear in what follows. The 
basic idea in our approach is to associate a multi-valued 
map T c with each experimental context c. A context c 
is defined as a particular experimental arrangement of 
measurement devices, preparation devices etc. Thus, an 
experiment involving several measurement contexts actu¬ 
ally corresponds to several sub-experiments each associ¬ 
ated with a different context. If we combine probabilities 
conditioned on different contexts it is quite possible to vi¬ 
olate the CHSH inequalities. Within any one context c, 
only a subset of possible measurements can actually be 
performed, and we cannot associate definite relative fre¬ 
quencies with outcomes of measurements that cannot be 
made within c. The multi-valued map T c only allows us 
to associate upper and lower subjective probabilities with 
UPVs. In this sense counterfactual outcomes are related 
to measurement contexts through the multi-valued maps. 

In the EPRB setup the different contexts are 
a, a', b , b', ab, a'b , ab', a'b' corresponding to differ¬ 
ent possible settings of the SG devices. The contexts 
a, a' make no reference to the the right particle’s SG de¬ 
vice, and likewise b, b' do not refer to the left particle’s 
SG device. The remaining contexts (namely ab, a'b, ab' 
and a'b') specify an arrangement of both devices simul¬ 
taneously. 

Let us consider first a complete table of known val¬ 
ues = Hkpv for the outcomes of measurements of the 
random variables A,A',B,B' : Q — > {±}. The table 
can be divided into rows v = 1 , ...,7V and columns 
f l c , c = a, a', b, b'. The intersection f \ vc := flj,nf2 c is sim¬ 
ply the singleton set {w„ c } consisting of the z/c’th table 
entry. Since every entry has value + or — we can partition 

in terms of disjoint subsets as = fl + U We can 
now define useful intersections such as f2+, := I2„ c D f2 + , 
which is empty if to^c = — and is equal to f \ vc otherwise. 

Considering only a single particle, we can define the 
following relative frequencies 


1 N 7\r± 

P(C = ±) = P(a£) = -Y, l n icl = -jf (3) 

v V=1 

where C = A,A!,B,B’, c = a,a',b,b'. Here |Sj is used 
to denote the cardinality of the set S , and iV+ = |I2+| 
and N~ = |H“| denote the number of entries in the c’th 
column with value + and value — respectively; iV+ + 
N~ = N. Considering both particles together we can 
define the joint coincidence frequencies 

1 N /\P fc 

P(IP C , n k ) :=-£inieiin^i = ( 4 ) 

where j = =b and k = zb. The above probabilities can be 
used to define individual averages and joint correlations 


as 


j = ± 

(CC') = jf E jkP^o')- (5) 

j,k—± 

In the case that the table is complete 12 = Hkpv the data 
necessarily obeys all Bell-type inequalities. 

Now consider the case in which the table has missing 
entries corresponding to UPVs. The total space can be 
partitioned as before as = I2 + U SI - , but now we also 
have the disjoint partitioning 12 = I2kpvUI2upv- We can 
also form the intersections fi^pv vc := f V vc D I2kpv and 

^UPV,yc * = G IIupv■ The set f^KPV,^c empty if uv c 
is unknown and is equal to f V vc otherwise. Likewise the 
set H|jp V vc is empty if w„ c is known and is equal to I2£ c 
otherwise. 

With each individual context c we associate the multi¬ 
valued map P c : ft c = {J u Q uc —► £{±} defined by 


+ 

W G ^KPV.c’ 


- 

W G fI K PV,c> 

(6) 

{±} 

W G Hupv,c) 



and with each joint context cd we associate the multi¬ 
valued map P cc / : (J, y VL„ c x f2„ c ' —> 5q±} x {±} defined 
by 


' (^c; ^ c' ) ■ — 


(j, k) 

(Wc, W C ') 

{(?,+), (J.-)} 

(ui c ,W c ') 

{{+■]),{- 3)} 

(w c ,Wc-) 

{±} x {±} 

K,Wc-) 


G U v ^KPV,i/c X ^KPV,i/c'i 
G Uji/ ^KPV,i 'c X ^UPV.i/c', 

G Uji/ ^UPV.vc x ^KPV.nc'! 

G U„ ^UPV,i/c x HuPV 

(7) 


These maps give rise to multiple upper and lower sets. 
For example, for j = ± 

j*r c '■= {w c : 0 7^ r c (w c ) C {j}} = f^KPV,c 

j* rc ■= {w c : r c (w c ) D {j} ^ 0} = ^kpv,c u ^upv.c (8) 


and for j, k = ± 


(j- ^)*r cc / — U ^KPV.i/c X ^KPV,i >c' 

V 

(jj fc)* r ° c ' = 

U^KPV, vc U Oupv.i/c) X (^KPV.i/c' G HuPV,!/(=')• 

(9) 


We can now define upper and lower probabilities using 
(3) and (4). Since each multi-valued map is associated 
with a specific context, so too are the upper and lower 
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probabilities. For example, using r c we have 

P.S) := P(j. r.) = 

P“(j) := P(i* r ') = '^cl + IOuPV..! 


( 10 ) 


The difference P* C (Q J C ) — P* c (f2£) = P(f2 upv.c) repre¬ 
sents Dempster’s “don’t know” probability associated 
with Dupv, c - That this quantity is nonzero reflects the 
fact that we cannot reveal any information about the 
value of the random variable C, within the context d . 
More colloquially, we “don’t know” what the values of C 
are, if what we are measuring is C'. 

In the case of measurements on both particles we have 
within the context cd 


N 


P*cc'ti,k) := Pfok). r J = Kpv.JI^kpv.vc'I, 


P* cc (j,k ) := PUk )* r <•] 
1 N 


— jy I^KPV,I 'C U ^UPV,i/c||^KPV,vc' U Oupv.i. 


U=1 


(ID 


In this case the “don’t know” difference is associated with 
the cases in which we do not know the value of C for the 
left-particle or we do not know the value of C' for the 
right-particle. If we restrict our attention to a single one- 
particle context c, then we cannot meaningfully associate 
frequencies with the values of C ^ C. Similarly in the 
two-partic.le case restricted to the context cd we cannot 
meaningfully associate frequencies with a pair of observ¬ 
ables (D, D') for which ff/Corff'^ C'. We can only 
meaningfully give subjective upper and lower probability 
intervals in these cases. Combinations of these subjective 
probabilities are quite capable of violating Bell-type in¬ 
equalities, though the sense in which such combinations 
are really meaningful raises delicate questions. 

There is however, a way to violate the CHSH in¬ 
equalities with meaningful contextual averages, which 
are naturally obtained from the Dempster-Shafer multi¬ 
valued maps. Each context c defines an observable C 
whose domain is the set of points for which T c is single¬ 
valued. The domain of such an observable is called a 
domain of certainty — a notion that plays a central role 
in the Dempster-Shafer treatment we employ. Within 
phenomenological treatments to EPRB-type experiments 
observables with disjoint domains of certainty could be 
termed incompatible. For example, considering only one 
particle we define C : Okpv.c —► {±} by C := F c |kpv- 
In the two-particle case we can also define the prod¬ 
uct variable CC' : (J ;y Dkpv^c x ^kpv,i/c' {A} by 
CC := F^/F^Ikpv, which denotes the product of the 
cartesian components of T cc i restricted to the subset 
U,y f^KPV.yc x ^kpv.i/c'- Any two distinct random vari¬ 
ables so defined are incompatible. With respect to these 


variables averages are only defined over domains of cer¬ 
tainty; 


N 


<c)o = ££ 


3 —± v=l 


■ll^KPV.i/cl 

|^KPV,c| 


N 


M 


/rr ,\ jfcl^KPV ,i/c II^KPV ,vc' 1 f 1 

~jfd ± I fi KPV l(ic X f^KPV.Mc' | 


Substituting these expressions into (1) it is clear that (1) 
can be violated. In fact, the upper bound on |/| becomes 
4 rather than 2. 

Along with the above averages we can define context 
restricted probabilities 


N 


Pc(C = j) :=]T 

V= 1 


I^KPV,t/el 

|^KPV,c| 


P cc '{C = j, C' = k) := 


N 

£ 

v=l 


I ^KPV.wc 11 ^KP V,vc' I 

| Uu ^KPV,mc x ^KPV 


(13) 


The first of these represents the frequency with which the 
outcome C = j is obtained given that the experiment is 
actually set up to measure C, i.e., given that the context 
is c. The second represents the probability that (j, k) is 
obtained given that both C and C' are actually simulta¬ 
neously measured, i.e., given that the context is cd. The 
relevance of this type of conditional probability in rela¬ 
tion to EPRB-type experiments was first pointed out by 
A. Fine [13]. In the treatment above context conditioned 
probabilities and averages generally violate all Bell-type 
inequalities, but nothing about this more general proba¬ 
bilistic treatment precludes the physical assumptions of 
classical determinism and locality. 


III. INCLUDING HIDDEN-VARIABLES 

Most treatments of EPRB-type experiments includ¬ 
ing Bell’s original treatment, start with the assumption 
that for each particle pair we can use hidden-variables to 
give a complete, classically deterministic specification of 
the real experimental state. These hidden-variables are 
assumed to belong to a single Kolmogorov probability 
space. Averages are defined over this one space, and the 
CHSH inequalities (1) necessarily hold. In this section 
we will relax the latter assumption and define contex¬ 
tual random variables whose domains are the domains of 
certainty of specific Dempster-Shafer multi-valued maps. 


A. Rigorous hidden-variable treatment 

First, for comparative purposes, we provide a rigor¬ 
ous formulation of the CHSH inequalities. We con¬ 
sider the standard EPRB setup in which the spins of 
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two particles produced by a common source are mea¬ 
sured. We formulate the present treatment within a sin¬ 
gle Kolmogorov space (fi, Ep, p). We define the variables 
A, A! : fl —> {±} representing spin observables in the di¬ 
rections a, a' for the left particle, and similarly we define 
the spin observables B, B’ : O —> {±} for the right par¬ 
ticle. The directions of these spin observables coincide 
with the contexts a, a', b , b' referring to the SG device 
alignments. We define the sets fl c := C _1 ({±}) and 
+ c := C _1 (+) where C = A, A', B , B' and c = a, a', b, b'. 
We also define the real random vector X : 17 —> {±} 4 by 
X(lo) := (A(co), A'(lo), B(u}), B'(u})). Finally we assume 
that to £ fl can be viewed as giving a complete descrip¬ 
tion of the underlying reality within the experiment, i.e., 
w represents a complete ontic state of the total physi¬ 
cal system. With everything defined as such we can now 
give the two-particle probabilities relevant to Bell-type 
inequalities as 

p(C=j,C'=k)=[ dp(oj). (14) 

J jc<~\k c i 

More generally we have 

l^( x = f dp(w). (15) 

J j a r\k a ,ni b nm b , 

There are 2 4 = 16 permutations of the outcomes 
j,h,l,m = ± appearing in the above expression, giv¬ 
ing the same number of probabilities /i (X = ( j , k, l, to)). 
These probabilities act as a basis in the sense that they 
can be used to express any other (absolute) probability. 
Examples of single-particle and two-particle probabilities 
are given by 

H(A = j) = ^ n (X = ( j , k, l , to)) , 

k,l,m 

M A = j,B = l) =J2v(X = (j, k, l, to)) . (16) 

k,m 

Single-particle and two-particle averages are given by 

(C) := [ dp(oj)C(oj), 

Jn 

(CC') := [ d/i(w)C(w)C'(w). (17) 

Jn 

Despite the fact that for a single particle we cannot si¬ 
multaneously attribute known possessed values to both 
of the observables A and A', equation (17) does not dis¬ 
tinguish between the averages like (AA 1 ), and meaningful 
two-particle correlations such as (AB). Similarly the for¬ 
malism does not itself distinguish between the probabil¬ 
ities p(X = ( j,k,l,m )) in which outcomes are simulta¬ 
neously associated with all observables, and probabilities 
like p(A = j,B = l ), which only associates outcomes 
with observables that can be simultaneously measured. 
In other words, because it has been built using a single 
Kolmogorov space the above hidden-variable approach 


treats counterfactual outcomes in the same way as the 
conventional phenomenological approaches discussed in 
section II A. In short, the above approach assumes that 
counterfactual outcomes can be treated in the same way 
as factual outcomes. It is convenient to refer to this as¬ 
sumption as simply the counterfactual assumption. 

The quantities above can be written in terms of con¬ 
crete probability densities by defining a Borel measur¬ 
able chart A : tt —> K™ consisting of hidden-variables 
that reveal the underlying physical states w £ fl. If 
we denote by £>(R”) the Borel subsets of R", the triple 
(R”,S(R”),Pa := p. o A -1 ) defines a concrete Kol¬ 
mogorov space. One can then define the observables C := 
C o A -1 over this space, and one can define the joint dis¬ 
tribution function Fa : R ra — > [0,1] by Fa(Ai, ..., A„) := 
Pa(Ai < Ai,...,A„ < A„). The probability density asso¬ 
ciated with Fa is defined by p\ ■= d n F^/d\\...d\ n . The 
number pa(A) characterises the observer’s epistemic, but 
purely “Kolmogorovian” uncertainty, as to the underly¬ 
ing physical state w, A(w) = A. This interpretation is in¬ 
herited from the interpretation given to the Kolmogorov 
probability measure p. The assumption of classical de¬ 
terminism means the observable C is assumed to pos¬ 
sess the definite value C(A) in the state A. Note that 
the formalism distinguishes between two distinct types of 
state—an ontic state ui (or equivalently A = A(w)), and 
an epistemic state p\. A simple example of the above 
formalism at work is given later on in section IV A. For 
now we note that equations (14) and (17) can be written 

P A (C = j,C' = k) = [ _ dP A (\)= [ _ d n X PA (X) 

Jjc<~\k c i JjcAk c , 

(18) 

and 

(C)= f d n X PA (X)C(X) 

J R n 

(CC’)= f d n X PA (X)C(X)C'(X) (19) 

JR" 

respectively. Within the framework discussed above the 
CHSH inequalities (1) necessarily hold. 

B. A contextual Dempster-Shafer approach 

We now offer an alternative approach to that above for 
the modeling of EPRB-type experiments. As in section 
IIA we achieve this using multi-valued maps. The re¬ 
sulting framework shares many features in common with 
Khrennikov’s contextual framework [6]. Our starting 
point is the idea that even if we only considered mea¬ 
suring the spin of a single particle in the different direc¬ 
tions c = a, a 1 , a single Kolmogorov probability space 
(f2, Eq, p) would not offer an adequate description of the 
experiment being envisioned. Using only a single space 
makes it impossible to account for the fact that different 
spin directions are not simultaneously measured. The 
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latter is an operational fact, which must be properly 
encoded within any theoretical treatment, regardless of 
whether or not we make particular physical assumptions 
like determinism or locality. Thus, the theory based on 
a single Kolmogorov space must be appended in order 
to properly account for the incompatibility between dif¬ 
ferent experimental contexts. In short, we avoid making 
the counterfactual assumption. 

A single-particle context is assumed to be determined 
by the alignment of the SG device. As both Bohr and 
later Bell repeatedly emphasized should be the case, the 
model of an experiment should describe the entire ex¬ 
periment, rather than just the systems being measured 
[14]. With this in mind we define f l = (J c fl c where 
c = a,a',b,b' are distinct SG device contexts. Each f2 c 
can be decomposed as = + c U — c with the sets ± c 
corresponding to the possible spin outcomes along the 
direction c. 

Just like in section II we associate with each c a multi¬ 
valued map T c : fl —> £{±} defined by 


T c (w) := < 


U±} 


w £ +o 
to £ —c, 
uj fl c . 


( 20 ) 


The uncertainty in the event space within the context c, is 
represented by the set fl c which denotes the complement 
of fl c . We can also define the joint-context multi-valued 
maps T cc , : fl —> £{ ± } by 


jk w£j c nfc c /, 
{±} lo (jL fl c D G c /. 


( 21 ) 


As before we define the random variables Tcjn^ =: C : 
fl c —> {±}, whose domains are the domains of cer¬ 
tainty of the T c . We also define the product variables 
CC' := r cc /|n e nn c /• With respect to these observables 
single-particle and two-particle probabilities are defined 
by 


mOcI^c) 


i 

p(X^c) 





p(jc G k c > |fi c n fl c /) 


1 

p(Ac n fic / ) 



dp(oj)- (22) 


The expressions in (23) are normalised over the domains 
of certainty defined by T c and T cc / respectively, and can 
be interpreted as conditional probabilities. In a similar 
fashion single-particle averages and two particle correla¬ 
tions are defined as 

( C )c ■■= dfj,(w) C(oj), 

( cc ')co’ := * T / WCMC'H (23) 

W^nilc') Jn c nn r , 


conditional expectation. If we substitute correlations of 
the form given in (23) into (1) we obtain as in section 
II an upper bound on |/| of 4 rather than 2. Thus, 
conditional probabilities and correlations do not satisfy 
Bell-type inequalities. Of course, the interpretation of 
quantum probabilities pertaining to EPRB experiments 
as classical conditional probabilities is well-known [9, 15]. 

As in section III A we can map fl into R ra . However, 
since in general we expect the epistemic states of an ob¬ 
server to depend on the context, we now associate with 
each context c a “local” (as opposed to global) chart 
A c : fl c —> R™ defined over the domain of certainty fl c . 
With each joint context cc' we associate another chart 
A cc i : fl c D H c / —> Q c fl Q c i defined over the domain of 
certainty fl fl c /. The local charts A c give rise to mea¬ 
sures P c , epistemic states p c , and observables C defined 
over := A C (H C ) £ £>(R"). Analogous measures, states 
and observables are associated with the joint contexts 
cc 1 . The set is the “concrete” domain of certainty 
in B( R"), that corresponds to the “abstract” domain of 
certainty £ T,q. The probabilities in (22) can now be 
written 


Pc(C = j ) := P c (j c \n c ) := 


Pc(tt c) 


d n \p c (\), 


Pcc'{C = j,C' = k):= P cc '{jc n k c ’\Cl c n Q c ,) 

—=— [ d n \ Pcc'iX). (24) 

p CC '(.n c nn cl ) 


The notation P(-|f2 c ) does not necessarily denote a con¬ 
ventional conditional probability within R”, rather it 
indicates that all probabilities pertaining to context c 
are normalised over Q c . In general one need not re¬ 
quire that P c (f2 c ) < 1. The normalisation factors in 
(24) can be absorbed into the definition of the densi¬ 
ties to yield absolute probabilities over the domains of 
certainty. More precisely, defining P' c := P c /P c (Cl c ) and 
P' cc > ■= Pcc'/PcA^c n f2 c <), we have 


Pc(C = j) = P' c Qc) = f d n \p' c (X), 

Jjc 

PcAc = j, c' = k) = PA(jc n k c t) = 


[ d n x P ' c Ax ). 

(25) 


These normalised measures allow one to define condi¬ 
tional probabilities within the domains of certainty in 
the usual way. The averages in (23) can be written 


(C) c = 


Pc(flc) Jn 

(CC') cc i = 


d n Xp c (X) C(X), 


1 


Pc(Ac G Xl c f) J £i c c\Ci\ 


d n X Pc (X)C(X)C'(X). 

(26) 


where again each expression is normalised over a con- Whenever the joint correlations in (26) can be expressed 
text specific domain of certainty, and so represents a in the form given in (19), the CHSH inequalities in (1) 
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hold. This requires the existence of a common density 
p{ A) corresponding to a global chart A : Q —> R", such 
that every probability is normalised over the total space 
U. Such a description would only generally be appro¬ 
priate if all measurements within the experiment were 
performed under the same physical conditions, that is, 
within the same context. Note that when a global chart 
is used but the probabilities are normalised only over the 
domains of certainty, they can be interpreted as condi¬ 
tional probabilities as in (22). 


C. Discussion 

Two different alignments of a SG device produce alto¬ 
gether different inhomogeneous magnetic fields over the 
spacetime region within which the spin measurement is 
performed. In general, the physical conditions under 
which a measurement is performed are at least partly 
determined by the state of the measuring device, which 
therefore influences the observed physical events. As Bell 
himself puts it: “ the results have to be regarded as the 
joint product of ‘system’ and ‘apparatus’, the complete 
experimental setup” [14]. In a deterministic theory the 
set of all possessed values for the observables of a phys¬ 
ical system can be divided into KPVs and UPVs. The 
KPVs are what Bell calls “results”, while obviously the 
UPVs are not “results”. As such UPVs should not be 
regarded as the joint product of system and apparatus. 
Rather, UPVs depend on the system alone. Moreover, 
in EPRB-type experiments the UPVs pertain to spin ob¬ 
servables that are operationally incompatible with the 
spin observables that are actually measured to give the 
KPVs. Crucially therefore, we cannot assume that the set 
of KPVs is representative of the set of UPVs.. In other 
words, if, in determining measurement outcomes, mea¬ 
surement devices are active rather than passive , then the 
counterfactual assumption is not valid whenever opera¬ 
tionally incompatible observables are being considered. 
Thus, we see that Bell’s own conception of measurement 
cannot generally be reconciled with the counterfactual 
assumption crucial for proving Bell-type inequalities. 

In our approach distinct and incompatible states of a 
measuring device result in different physical contexts c. 
Each context c is associated with a multi-valued map r c . 
With respect to a given context c, the uncertainty associ¬ 
ated with counterfactual events taking place in a different 
context c' ^ c, is non-Kolmogorovian epistemic uncer¬ 
tainty pertaining to the underlying event space. This 
leads to the conditioning of densities on measurement 
contexts, and results in probabilities as in (22). These 
probabilities are defined in terms of different measures 
and are normalised over different domains. As a result 
they are not constrained by Bell-type inequalities. The 
contextual framework given here, offers an explanation 
as to why violations of Bell-type inequalities have actu¬ 
ally been observed in the lab [16] without resorting to 
nonlocality or indeterminism. 


1. Free will 

In mathematical terms the contextual formulation 
above uses multiple Kolmogorov probability spaces to 
model the EPRB experiment. This is deemed appro¬ 
priate because the experiment actually consists of four 
incompatible experimental contexts. Thus, rather than 
using a single measure P corresponding to which there is 
a single density p , we use multiple densities p c , which are 
labeled by context. The assumption that a single density 
suffices, so that 

P (A) = p c ( A) (27) 

is called measurement independence in [17, 18], and is dis¬ 
cussed in [19]. It is argued that this assumption encodes 
the “free will” of the experimenter in choosing measure¬ 
ment settings. Thus, the use of different Kolmogorov 
probability spaces to model different and incompatible 
(sub)experiments is somehow seen as violating the as¬ 
sumption that observers have “free will”. 

First we note that even when equation (27) holds, Bell- 
type inequalities will generally be violated if the prob¬ 
abilities involved are interpreted as conditional proba¬ 
bilities [9, 15, 20]. Furthermore, absolute relative fre¬ 
quency probabilities are necessarily absolute Kolmogoro- 
vian probabilities and therefore they must satisfy Bell- 
type inequalities. This means that quantum probabili¬ 
ties violating Bell-type inequalities cannot be interpreted 
as absolute frequency probabilities [15, 20]. A violation 
of Bell-type inequalities can only be explained through 
some sort of conditional probability model [15, 20]. 
The challenge one is faced with in constructing a de¬ 
terministic hidden-variable model, is the reproduction 
of the quantum predictions using either conventional 
event-conditioned classical probabilities, or perhaps us¬ 
ing context-conditioned classical probabilities [32]. Re¬ 
garding the latter, one can simply interpret “measure¬ 
ment dependence” as the assumption that incompati¬ 
ble experimental contexts require separate probability 
spaces. This is justified through the assumption that 
experimental results are the product of the system and 
the apparatus. 

Consider, for example, a situation in which a physical 
system S is prepared at time —to and that this results 
in some ontic (possibly unknown) state A(—to)- Suppose 
that at time t = 0 the system is to be measured for some 
duration T, with a device M whose state A c is always 
known and can be controlled by the experimenter. In 
many cases a reasonable assumption might be that the 
state of the macroscopic measuring device is stationary 

A c (t) = A c . 

We can now define a “free will” condition as the as¬ 
sumption that A c and A are independent variables for 
t < 0. This means that A c which can be chosen at any 
time t < 0 is not constrained by the system S. How¬ 
ever, during the measurement process S and M will cer¬ 
tainly interact, so there is no good reason to assume 
that A(t) will be independent of A c for t > 0. The 


probability density p(A(t)) over ontic states taken at the 
time of the measurement therefore depends on A c (as 
well as A(—to) and other initial data). We can therefore 
write p(A(t)) = p(A(-f 0 ), A c , t) = p c (X(-t 0 ),t) for t > 0. 
Thus, if the probability to obtain a particular measure¬ 
ment result is calculated from a density over ontic states 
taken at the time of the measurement , then the probabil¬ 
ities will be contextual. This does not violate “free will”, 
but simply indicates that the system and measurement 
device have interacted, so the result is not “measurement 
independent”. 

The above qualitative analysis implies that the measur¬ 
ing device generally disturbs the system being measured 
and so it is not passive within the experiment. The as¬ 
sumption of a passive measurement device may often be 
made when dealing with macroscopic systems in classical 
physics, but this assumption is not a fundamental pos¬ 
tulate of the latter, and it seems ill-justified when con¬ 
sidering microscopic systems. Indeed, the assumption 
that measurement devices are always passive in classical 
physics introduces a kind of classical measurement prob¬ 
lem in which measurement devices are given a privileged 
role as special systems that do not disturb the systems 
they interact with. 

IV. RELATION TO CONVENTIONAL 
APPROACHES 

In this section we attempt to assess how the rigorous 
treatment in III A, the contextual treatment in III B, and 
conventional hidden-variable treatments found through¬ 
out the physics literature, are each related to one another. 
Throughout the physics literature pertaining to Bell-type 
inequalities, one frequently encounters heuristic expres¬ 
sions for what is known as Bell’s locality assumption writ¬ 
ten in the form 

P\(j, k\a, b) = p\(j\a)p\(k\b). (28) 

Here j and k are left and right-particle spin outcomes 
respectively, a and b are left and right-SG device settings 
respectively, and as usual A is supposed to offer a com¬ 
plete description of the underlying state of affairs. The 
labels 1 and 2 in (28) refer to the left and right-particles 
respectively. This particular notation was borrowed from 
[21]. Sometimes one sees A appearing as a conditioning 
event as in the following expression, borrowed from a re¬ 
cent article [22] 

p{j, k\a , b) = J dX p(X)p(j , k\a, b , A). (29) 

In our notation (28) would read 

Pab{A = j,B = k\X) = p a (A = j\X)p b (B = fc|A) (30) 

though we have not yet offered an interpretation of the 
conditioning of our probabilities on an underlying phys¬ 
ical state A. This will be done in what follows. 


A. The role of conditional probabilities 

The probabilities on the left-hand-side in (29) are our 
primary interest as these are the probabilities appear¬ 
ing in Bell-type inequalities. Let us therefore focus on 
the left-hand-side of (29). Suppose that we treat the de¬ 
vice settings a and b as genuine conditioning events in 
an event space Q. This, after all, is what the notations 
in (28) and (29) suggest we should do. As soon as we 
choose to do this it becomes impossible to avoid includ¬ 
ing a description of the SG devices within the probabilis¬ 
tic treatment of the experiment. Proceeding along this 
line of inquiry, our aim in this section is to relate the 
expressions in (28) and (29) to the expressions derived in 
IIIB. To this end we interpret the left-hand-side of (29) 
as being given by (24); 

Pab{A = j,B = k) = Pabtia O k b \£l a D Cl b ). (31) 

Now we would like to determine whether or not the right- 
hand-side of (29) can be understood as being equal to 
the right-hand-side of (31). The probabilities in (28) 
and (29) are supposed to be conditioned on the settings 
of the SG devices. This gives the impression that the 
states of the measuring devices have been properly taken 
into account. However, the expressions obtained do not 
represent true conditional probabilities, because they are 
not normalised as such. Rather they are normalised by 
p(fl) = 1. Perhaps the conditioning is supposed to have 
been taken care of by the hidden-variables A. Let us 
investigate this possibility. 

In the rigorous measure-theoretic formulation of Kol¬ 
mogorov probability theory (c.f. [23-25]) a probability 
density is associated with a real random vector A : H —> 
M", which defines a measure P\ := p o A -1 : H(R”) —> 
[0,1] where H(K n ) is the set of Borel subsets of R n . Note 
that the domain of P\ implies that A -1 : £>(R”) —y Sq 
denotes the pre-image map, which is well-defined whether 
or not A is invertible. In turn one can define an absolutely 
continuous distribution F\(X) := Pa(B\) = p(A < A) 
where B\ := (—oo, Ai] x ... x (—oo,A n ] £ £>(K"). The 
associated density is defined as p\ ■= d n F\/dXi...dX n . 
Formally pa (A) can be thought of as the probability that 
A = A, and could be denoted p(A = A). One can ex¬ 
tend the construction to deal with conditional proba¬ 
bilities by defining for S = A ~ 1 {B) £ £q the measure 
ps p(-|<S) := p{- fl S)/p(S). With this one can define 
a measure Pa\b over H(R") as 

Pa\b{B') := p(A~ 1 (B , )\A~ 1 (B)) = -±- [ dp 

M*- 5 ! JA-Rsnsq 

= <32) 
Then one defines the conditional distribution Fa\b(X) := 
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Pa\b(B\) and the associated density 


Pa|b(A) 


d n F A \ B (\) _ p A {X)5 B (X) _ p A {X)S B 

d\ 1 ...ax n ~ p A (B) - f B d n \p A (\) 

(33) 


where S B is the Dirac measure associated with B ; 


Sb( A) 


1 A £ B , 
0 X£ B. 


(34) 


According to this definition pa|b(A) is normalised to 
unity indicating that it is a genuine probability density. 
Equation (33) yields for a subset S' = A ~ 1 (B') £ Eq 
with B' £ 2?(R), the following expression for a condi¬ 
tional probability written in terms of the corresponding 
density; 


p(S'\S) = P A (B'\B) = 


Pa(b' n B) 
Pa{B) 


J B 'nB dnX P^ X ) 

J B d n Xp A (X) 



Finally, we can define the conditional probability 
Pa(B\X) by 


P a (B|A)pa(A) = P A (B) Pmb (X) (36) 

whenever p A (X) ^ 0 almost everywhere (except on a set 
of measure zero). Comparing this expression with (33) 
yields 


P a [B\X) = S B (X). (37) 

This result is precisely what we should expect to find, 
because we are assuming that A represents the precise 
state of affairs within the underlying physical reality. If 
we are given A we can be certain which events will and 
will not occur. The quantity P A (B\X) represents the 
probability of event B given A. Thus, P A (B\X) must 
be either 0 or 1. More precisely, the event B is certain 
to occur if A £ B and certain not to occur otherwise, 
hence P A (B\X) = Sb(X). Another way to view P A (B\X) 
is as the probability of B given that the epistemic state 
p A is the delta function p A (X r ) = S(X — A'), which cor¬ 
responds to the situation in which we have complete 
knowledge of the underlying reality. Borrowing terminol¬ 
ogy from quantum theory, delta-function epistemic states 
Pa(X') = <5(A — A') could be termed pure states , with all 
other epistemic states termed mixed states. These pure 
states are clearly in one-to-one correspondence with the 
ontic states A. Now, from either (37), or directly from 
(36), it follows that 

Pa(B) = J d n X PA (X)P A (B\X) = J^d n Xp A (X) (38) 

of which the first equality appears to be quite close to 
what we see in (29). Before we use the above formalism in 


analysing the conventional hidden-variable approaches, 
it may be instructive to see it in action using a simple 
example. 

Consider a point particle moving in one-dimensional 
Euclidean space E 1 . With initial conditions the second 
order dynamics resulting from Newton’s laws defines a 
well-posed Cauchy problem. A position value and veloc¬ 
ity (momentum) value suffice to give a complete physi¬ 
cal description of the particle. Thus, the state (event) 
space is SA = T*E 1 , which denotes the cotangent bun¬ 
dle of E 1 . We can turn the manifold T*E 1 into a Kol¬ 
mogorov space by equipping it with a Kolmogorov mea¬ 
sure p : Y,T*Eq —> [0,1] where T, t * e i is a suitable a- 
algebra. Since T*E 1 is a flat manifold it admits a global 
coordinate chart A : T*E 1 — > R 2 , which is associated 
with some family of observers O. A particle state is a 
phase point ui £ T*E l with coordinate representation 
A(w) = (x,p) £ R 2 relative to O. A particle observable is 
a suitably well-behaved (e.g. smooth, square-integrable) 
function / : T*^ 1 — > R, which admits the coordinate rep¬ 
resentation / := fo A -1 : R 2 —► R. Finally P A := po A -1 
defines a normalised Kolmogorov measure on R 2 and an 
epistemic state associated with the observers O is an in¬ 
tegrate density p A such that 

P A {U)= f dx dpp A [x,p) (39) 

Ju 

where U := A (U) and U £ E t . b i . Different choices of p A 
correspond to different choices of P A , which correspond 
to different choices of p. This means that different choices 
of p A encode different epistemic states of Kolmogorovian 
uncertainty regarding the ontic state of the particle ui £ 
T*E 1 . The average value of observable / measured by 
observers O with epistemic state p A is 


(/) := [ dxdpp A (x,p)f(x,p). (40) 

Jr 2 

The probability that relative to O the particle’s state 
belongs in U := A(U) is 


p{U) = P A (U) 


dxdp p A {x,p) 


/r 2 


dxdpSjj(x,p)p A (x,p). 


(41) 


If p A (x,p) = 8(x — Xo)S(p — po) the observers know with 
certainty that the state of the particle has coordinate 
representation (xq,pq) relative to them. They therefore 
know with certainty whether or not the state belongs in 
U, i.e., whether or not (xo,Po) belongs in U. For this 
particular epistemic state the probability in (41) is equal 
to one if (xo,po) 6 U and equal to zero otherwise, i.e., 

Pa{U) = / dxdp5jj{x,p)5(x - x 0 )6(p - p 0 ) 

Jr 2 

= 8fj(x 0 ,po) = Pa(U\xq,po)- (42) 
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For general p Aj (41) can be written 


Pa(U) 



dx dp p A (x,p)Sfj(x,p) 



dx dp p A {x , p)P{U\x, p) 


IV, which leads to (37), is general enough to include any 
ontic state of the type encountered in classical physics. 
But if this is indeed our only option, then A-conditioned 
probabilities will always trivially satisfy Bell’s locality 
assumption. 


which is just a specific example of expression (38). 


V. EXAMPLE 


B. Comparison of conventional and contextual 
approaches 

Having determined with the formalism above precisely 
how to interpret probabilities that are conditioned on the 
ontic state A and having seen this formalism work in a 
simple setting, we now have all the ingredients we need 
in order to compare conventional approaches with our 
contextual Dempster-Shafer approach. First we use (37) 
to interpret the probabilities in (30) as 

Pab{A = j 7 B = k\\) =5~. an ~ kb (X), (44) 

which using (25) and (38) yields 

p ab (A = j,B = k) = J d n \p' ab WPab(A=j,B = k\\). 


In this section we examine a model of EPRB-type ex¬ 
periments that fits within our contextual approach. The 
model is quite similar to a model originally proposed by 
Barut and Meystre [28] , and is designed to reproduce the 
quantum predictions. 

A. The model 

To begin with we consider one particle and two orienta¬ 
tions c = a, a' of an SG device. For simplicity we restrict 
the model to two spatial dimensions (the 22 -plane). The 
spin observable in the c-direction C =: — > {±} is 

defined by 

CM = ( + £ + “ (46) 

— ui c £ — c - 


(45) 

Equation (45) is similar to (29), but of course the prob¬ 
abilities in (45) are defined differently to those in (29). 
Unlike those in (29), the probabilities in (45) do not obey 
Bell-type inequalities. The main difference between (45) 
and (29) is that in the latter the labels a, b pertaining 
to the measurement devices appear to be merely nota- 
tional tools, while in the former they take an active role 
in defining the probability measures P ab (and hence p a b ), 
and in properly normalising the resulting probabilities. 

The question arises as to the sense in which our ap¬ 
proach can be considered local. Although Bell considered 
his locality assumption (28) crucial for the proof of his 
theorem, it was noted some time ago by Accardi, that 
the crucial assumption underlying Bell-type inequalities 
is the counterfactual assumption not Bell’s locality as¬ 
sumption [26, 27]. In fact Accardi has shown that Bell’s 
inequalities hold for spin random-variables defined over a 
single Kolmogorov space, even if one assumes the nega¬ 
tion of his locality assumption [26] . 

The idea underlying our contextual framework is that 
observed physical events in an experiment are dependent 
on both the measured systems and the measurement de¬ 
vices. While this invalidates the counterfactual assump¬ 
tion, Bell’s locality assumption (30) remains perfectly in¬ 
tact, and is satisfied as an immediate consequence of (44) 
and the identity Sadb = 5a5b- Moreover, given that A 
is supposed represent the complete ontic state of the sys¬ 
tem, it is difficult to see how to interpret probabilities 
conditioned on A in any other way besides as Dirac mea¬ 
sures of the form 5a (A). Indeed the formalism of section 


We consider a classical particle with angular-momentum 
spin unit-vector S, which can be fully specified by an 
angle A relative to the 2 -axis. We fix the unit vector 
c = a, a' giving the alignment of the SG device, which 
can be fully specified by an angle A c . We define the chart 
A c : il c —>■ R by 

A.M = ()■)” (47) 

I A — A_ c U) c t — c- 

We now choose the sets ± c such that together (46) and 
(47) yield the representation C : Cl c —> {±} defined by 


C( A) 


A A — A c £ [ 7t/2,7t/2) 

- A - A_c £ [— tt/2, 7t/2) . 


(48) 


Since A_ c = A c + 7r, the sets ± c are given by 



(49) 


which respectively represent the north and south hemi¬ 
spheres of the sphere for which c specifies the north pole. 
The set U c = A c U — c evidently represents to the whole 
sphere. 

The density associated with the measure P c for this 
example is defined as 


p c { A) := N | cos (A - A c )| 


N cos(A — A c ) A £ Ac 
—N cos(A — A c ) A £ — c 
(50) 
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where N is an arbitrary normalisation constant. The 
associated nomalised density is therefore 


\ cos (A - A c ) As + c 
-Acos(A-Ac) A G — c . 


(51) 


The factor cos(A — A c ) gives the component of S in the c- 
direction whenever S belongs to the northern hemisphere 
+c- Likewise — cos(A — A c ) gives the component of S 
in the direction —c whenever S belongs to the southern 
hemisphere — c . Thus, the probability densities to obtain 
the values + and — are proportional to the component 
of the spin in the positive and negative direction of the 
SG alignment. Note that the density in (50) is positive 
semi-definite on the domain of certainty fI c . Using (22) 
and (50) we obtain the single-particle probabilities 


Pc (±) = P c (± c |f2 c ) = P'(± c ) = i (52) 


The average spin measured when the SG alignment is c 
is 

{C) c = [ dX Pc (X)C(X) = dXp c (X)C(X ) = 0. 

(53) 


The probabilities p c (±) = 1/2 are the same as the quan¬ 
tum probabilities 

2ty(±) : = \ (! P c - ( ct1 ® /2 )b) = \ ( 54 ) 

where cr 1 denotes the Pauli operator-valued three-vector 
for the particle being considered, and I 2 denotes the 2- 
dimensional identity operator on the Hilbert space of a 
second particle. The average in (54) is taken in the two- 
particle entangled (Bell) state 

W “ (its ) 1 ® Its ) 2 - Its ) 1 ® Its) 2 ) (55) 

and like (C) c in (53) it is equal to zero. 

In this example there is a clear distinction between 
the spin of the particle in the c-direction S c and the spin 
observable in the c-direction, which is C S c . This is be¬ 
cause what is observed in the experiment depends on the 
entire experimental arrangement, including the measur¬ 
ing device. The observable C represents a joint system- 
apparatus observable and the results are “ regarded as the 
joint product of system and apparatus... 14]. It is inter¬ 
esting to note that in a sense this is also how the quantum 
formalism works. The state of the particle pair is given 
by (55) with the label S referring to the particles, but 
with no explicit reference being made to the alignment c 
of either SG device. Meanwhile the observables (projec¬ 
tions) 7T± C := (1 =p c • cr)/2 clearly refer to both a particle 
and its measuring device. 

Let us now consider the full two-particle EPRB setup. 
Along with the first SG device measuring the left- 
particle, we consider a second SG device with possible 


alignments 6, b' measuring the right-particle. The total 
spin of the particle pair is zero, so the right-particle has 
spin —S, which is opposite to that of the left-particle. 
Thus, for the right-particle the spin observable in the 
context c has value + when — S has positive component 
in the direction c, and has value — otherwise; 


C(X) = 



X - A_ c G [-71-/2,71-/2) 
A - A c G [—7r/2, 7 t/2) 


(56) 


where C = B, B'. The sets ± c , c = b,b' are given by 


71- 37T 

x c + -,x c + — 


7T 71" 

A C --,A C + - 


)■ 


(57) 


We now define the normalised joint-context probability 
densities p' cc , with c = a, a 1 and d = 6, b' by 

P'cc'W = UPcW + Pp'c'W ( 58 ) 

where a, /3 G [0,1] are constants such that a + /3 = 1. 
For a particular choice of a and /3 the above definition 
uniquely specifies p' cc , on the whole domain f2 c nfl c /. The 
fact that p' cc , can be expressed as a normalised combina¬ 
tion of single-particle densities seems similar to the quan¬ 
tum superposition principle, but actually apart from the 
requirement that a + /3 = 1 one has complete freedom in 
the choice of a and j3. 

Despite the fact that the EPRB-setup involves two par¬ 
ticles, there is only one unknown quantity A. The proba¬ 
bility p a b{j , k) can be thought of as the probability that A 
belongs to both j„ and kb , so it is obtained by integrating 
the density p' a over the intersection j a D kb. Since p' b is 
obtained from p' a by a shift in coordinate axes and an al¬ 
ternative choice of a and /3, p a b{j, k) can also be obtained 
by integrating p' b over j a Akb- To understand this freedom 
note that since the context ab is the simultaneous reali¬ 
sation of contexts a and b the observables A, B and AB 
are trivially compatible. We can therefore say that the 
contexts a, b and ab are themselves compatible—they can 
be simultaneously realised. Indeed ab can be viewed as a 
restriction of either one of the contexts a or b due to the 
other one. This is reflected by the domains of certainty 
through the relations D a fl Dt, C D a and D a fl fib C fV 
The densities p' a and p' b quantify the (Kolmogorovian) 
epistemic uncertainty in A within the contexts a and b re¬ 
spectively, i.e., over the domains fl a and f l b respectively. 
Since p' a and p' b quantify the same uncertainty relative to 
different domains, the joint probabilities found using p' a 
and p' b should coincide on the common domain fl a D STfo. 
Because this is the case p' ab is uniquely interpretable as 
quantifying the uncertainty in A within the context ab. 

In the case c = a, d = b (58) yields 

Pab{+ , +) = ^ [1 - cos(A a - A b )] = Pab(-,~) 

Pab(+, -) = ^ [1 + cos(A a - Ab)] = Pab{~, +), (59) 
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which are the same as the quantum predictions obtained 
using the Bell state (55). The average (AB) ab can be 
calculated from the probabilities above as 

{AB) ab = ^ ^Pab(j =k)~ ^Vabij ± k) 

= — cos(A a — Ab) = —a ■ b, (60) 

which is the same as the corresponding quantum expecta¬ 
tion value. It should be clear from this that the present 
model completely reproduces the quantum predictions, 
but is also completely classical and deterministic. 

We note finally that if one chooses the normalisation 
N = 1/8 in (50) then P C (Q C ) = 1/2 can be viewed as 
the probability that the SG device setting c is selected. 
Similarly one can define P cc i such that P cc i (f l c n fl c /) = 
P c (fl c )P c i(fl c i) = 1/4 can be viewed as the probability 
that c and c' are selected. This normalisation is appropri¬ 
ate for modeling Aspect’s experiment in which switches 
randomly flip between polarisers a and a' as well as be¬ 
tween b and b' during the propagation of the light mea¬ 
sured [16]. The probabilities in (52) and (62) can then be 
understood as the conditional probabilities for outcomes 
given particular device settings are selected. 


B. On the question of locality 

We consider here the status of the above model with 
regard to the question of locality. 

1. Bell’s locality 

Bell’s locality assumption is satisfied in the form 

Pab{j,k\X) = S- jan ~ kb (X) = Sj a (X)6 kb (A) = p a (j\X)pb(k\X) 

(61) 

where the sets ± a are defined in (49), and the sets ±6 
are defined in (57). The above is of course just a special 
case of (30), which holds generally within our contextual 
framework. 


2. Einstein locality 

We comment now on another notion of locality. The 
model above could be deemed to be local if in the two- 
particle case, all single-particle probabilities are identical 
to those obtained in the single-particle case. This condi¬ 
tion is called Einstein causality , also known as parameter 
independence [21], Einstein locality [29], or simply local¬ 
ity [29, 30]. It refers to the impossibility of superluminal 
signaling. Parameter independence is satisfied within the 
above model. To see this note that the single-particle 
probabilities in (52) refer to a particle for which the spin 
is measured along the a, a' directions, with no reference 


being made to any other particles or any other SG de¬ 
vices. In the model of the full two-particle EPRB-setup, 
single-particle probabilities for the left-particle are given 
for c = a, a' by 

Pc(j |A) = ^ Pcb{j, fc|A) = 'Y^Pcb'{j,k\X), 

/c=± fc=± 

Pc{j) = ^2 Pcb ti ’ = Pcb '(i ’( 62 ) 

k—zL k—ziz 

and the probabilities p c (j) above are identical to those 
found in (52). Thus, the probabilities pertaining to the 
left-particle obtained by considering the full two-particle 
setup, are identical to those obtained in the case that no 
right-particle exists. In particular the probabilities on the 
left-hand-side in (62) are independent of the context label 
c' = 5, b' chosen for the right-particle. The above result 
is analogous to the “no-signalling” theorems of quantum 
theory [31]. 


3. Discussion 

There are numerous situations in classical physics in 
which spacelike separated events are statistically corre¬ 
lated. This instantaneous action-at-a-distance is associ¬ 
ated with logical inference rather than causal influence, 
so no violations of causality ensue. For example, if the 
spin of the left-particle S is found to be + in the di¬ 
rection a, then we know with certainty that the spin of 
the right-particle is — in the direction a, and this logical 
inference propagates instantaneously. Outcome indepen¬ 
dence is the condition of statistical independence given 
the underlying ontic state A; 

Pa 01 a) = Pa 01 a, k), p\{k\b) = p\(k\j, b) (63) 

where we have again borrowed the notation from [21]. 
Together, parameter independence and outcome inde¬ 
pendence are equivalent to Bell’s locality assumption [30]. 

Any theory consistent with special relativity must sat¬ 
isfy Einstein causality. However, it appears that quan¬ 
tum theory violates outcome independence, where clas¬ 
sical theory cannot. This means that quantum theory is 
nonlocal in a sense that classical theory is not. In the 
Copenhagen interpretation of quantum theory, a particle 
can only be said to possess a definite spin value if it is 
in an eigenstate of the appropriate spin operator. In this 
sense quantum theory is indeterministic. As a result, in 
quantum theory a violation of outcome independence oc¬ 
curs when a measurement is made on the left-particle in 
the Bell state (55), because this measurement physically 
determines the spin of the right-particle through the col¬ 
lapse of the composite superposition (55). Prior to the 
measurement of the left-particle, the right-particle could 
not be said to possess a definite spin value, because the 
Bell state (55) is not an eigenstate of the right-particle’s 
spin operator. This nonlocal action is viewed as the cause 
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of the correlations predicted, which cannot then be in¬ 
terpreted as due to mere logical inferences. This is what 
Einstein described as “spooky action-at-a-distance”. 

Since experiments violate the CHSH inequalities, it is 
alleged that the “spooky action-at-a-distance” of quan¬ 
tum theory must be accepted. There are two fallacies 
from which this conclusion results. The first is that 
Bell-type inequalities rest upon Bell’s locality assump¬ 
tion, and in particular on outcome independence. The 
second is that classical theories cannot violate Bell-type 
inequalities. Of course, quantum theory is (by definition) 
non-classical. What is more, in the Copenhagen inter¬ 
pretation it is not deterministic, and it violates outcome 
independence. It also happens to violate Bell-type in¬ 
equalities. However, this does not imply that the only 
way to violate Bell-type inequalities is to use a non- 
classical, nonlocal, indeterministic theory. Actually as we 
have shown, one can reproduce the quantum predictions 
with a completely deterministic, local, classical theory, 
provided one adopts a sufficiently refined view towards 
measurement. As a result the fallacious conclusion that 
Bell’s inequalities necessarily impose constraints on clas¬ 
sical physics should be summarily disregarded. 

The example above is purely classically deterministic, 
so one can argue that in this context the only meaning¬ 
ful notion of locality is Einstein causality, i.e., parameter 
independence. One does not need to distinguish between 
two types of locality, and explanations for certain corre¬ 
lations as resulting from a “spooky action-at-a-distance” 
are unnecessary. Having eliminated the false idea that 
classical correlations cannot violate Bell-type inequali¬ 
ties one is free to interpret the correlations observed in a 
purely classical way. 

VI. CONCLUSIONS 

We have introduced an alternative probabilistic model 
of EPRB-type experiments using multi-valued maps of 
the kind found in Dempster-Shafer probability theory 
[7, 8]. This has lead to a contextual approach, which 
appears to be similar in nature to Khrennikov’s [6, 9]. 
Our approach gives an explicit link between measurement 


contexts and counterfactual outcomes. The uncertainty 
associated with the latter is non-Kolmogorovian epis- 
temic uncertainty in the underlying event space fl, which 
is divided up according to different measurement contexts 
using the multi-valued maps. These concepts have been 
applied to both phenomenological and hidden-variable 
treatments of Bell-type inequalities. In both cases the 
counterfactual assumption is seen to be invalid. The en¬ 
suing non-Kolmogorovian epistemic uncertainty leads to 
context conditioned probabilities that do not obey Bell- 
type inequalities. 

The conditioning of probabilities on measurement con¬ 
texts can be viewed as the result of a formulation in which 
the measurement devices are not passive, but active in 
determining physical events. This idea of what should 
be taken to constitute a “measurement” was advocated 
by Bell long ago [14], What appears not to have been 
recognised by many physicists is that this has implica¬ 
tions for classical as well as quantum physics. It implies 
that the restriction to a single Kolmogorov probability 
space should not be viewed as a necessity for a classical 
description. Kolmogorov probability theory and classi¬ 
cal physics are two distinct theories—one is a theory of 
probability and one is a theory of physics. Although in 
some cases they may be inter-related, it is a mistake to 
conflate assumptions about the former with assumptions 
about the latter. This is precisely the mistake one makes 
in concluding that Bell-type inequalities constrain classi¬ 
cal physics. 

While the context-conditioned probabilities we have 
obtained do not obey Bell-type inequalities, there is noth¬ 
ing about our approach which precludes classical, deter¬ 
ministic, local physical models. In particular we have 
constructed a local and deterministic classical model, 
which reproduces the quantum-mechanical predictions. 
From this point of view, the incorrect conclusion that vi¬ 
olations of Bell-type inequalities rule out deterministic, 
local, classical theories simply results from an oversim¬ 
plified conception of measurement. 
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